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We examine the uncertainty of perturbative QCD factorization for hadron structure functions in 
deep inelastic scattering at a large value of the Bjorken variable xb- We analyze the target mass 
correction to the structure functions by using the collinear factorization approach in the momentum 
space. We express the long distance physics of structure functions and the leading target mass 
corrections in terms of parton distribution functions with the standard operator definition. We 
compare our result with existing work on the target mass correction. We also discuss the impact of 
a final-state jet function on the extraction of parton distributions at large fractional momentum x. 
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I. INTRODUCTION 



Much of the predictive power of perturbative Quan- 
tum Chromo Dynamics (pQCD) is contained in factoriza- 
tion theorems and in the universality of non-perturbative 
hadronic matrix elements Predictions follow when 
processes with different hard scatterings but the same 
matrix elements are compared. In the case of leading 
power contributions, the universal matrix elements are 
interpreted as parton (quark or gluon) distribution func- 
tions (PDFs). With the PDFs extracted from a global 
QCD analysis @, i, Q, pQCD has been very success- 
ful in interpreting and predicting high-energy scattering 
processes. 

However, significant uncertainties still exist in the 
PDFs due to the accuracy of experimental data and to 
unknown higher order corrections to perturbative calcu- 
lations. In particular, the PDFs are least constrained in 
the region where the parton momentum fraction x > 0.5 
for valence quark distributions and cc > 0.3 for gluon and 
sea quark distributions 0, On the other hand, pre- 
cise PDFs are needed for many reasons '5']. For exam- 
ple, the discovery potential of the Large Hadron Collider 
(LHC) on new physics as excess in particle /jet spectrum 
at large momentum requires accurate PDFs at large x 
and large factorization scale /i. Since PDFs at a large /i 
are obtained by solving DGLAP evolution equations with 
input PDFs at a lower factorization scale, and the evo- 
lution feeds the large- a; partons at a lower scale to those 
at a higher scale with smaller momentum fraction x, the 
precision of PDFs at large fj, depends on the accuracy 
of PDFs at large x and low factorization scale. Further- 
more, reliable information on the ratio of d{x)/u{x) as 
X — > 1 could provide very important insights into the 
non perturbative structure of the nucleon [1, 0, B 
and references therein. However, because of the PDFs 
steeply falling shape as a function of ^1, and be- 

cause of the convolution of two PDFs, most observables 



in hadronic collisions do not provide tight enough con- 
straints to the PDFs at large x. On the other hand, inclu- 
sive lepton-hadron deep inelastic scattering (DIS) at large 
Bjorken is a more direct and clean probe of large-x 
parton distributions. Recently, experiments at the Jef- 
ferson Laboratory have produced DIS data at large xb 
with high precision, but at relatively low virtuality of 
the exchanged virtual photon in lepton-hadron collisions 
[13, [m, |12,] . Experiments measure DIS cross sections, or, 
equivalently, the DIS structure functions, not PDFs. In 
order to extract PDFs at large x from these and other 
data at low Q^, it is necessary to have theoretical con- 
trol over power corrections, such as the dynamical power 
corrections (or high twist effects), oc ^ ^c.n /Q^ with the 
non-perturbative scale Aqcd ^ 1/fm the tar- 

get mass corrections (TMC), cx x^rnf^/Q^ with nucleon 
mass rriN and possibly, final-state jet mass correc- 
tions (JMC), oc rnj/Q"^ [1^. These corrections become 
larger and larger as data approach the kinematic limit 
Xb = 1. In this paper, we examine the uncertainties in 
extracting PDFs at low and large xb caused by the 
target mass and jet mass corrections. 

At the leading power, the perturbative QCD fac- 
torization treatment of DIS cross sections neglects all 
1/Q^-type power corrections. However, TMC play a 
somewhat special role. Since the mass of the target 
is a non-perturbative quantity, the partonic dynamics 
of short-distance factors in the QCD factorization for- 
malism should not depend on it. Therefore, for any 
hadronic cross section that can be factorized in pertur- 
bative QCD, the effect of TMC should be implicitly in- 
cluded in the definition of the non-perturbative hadron 
matrix elements, and explicitly accounted for in the kine- 
matic variables of the observables. In this sense, TMC 
are mostly of kinematic origin. At large xb and low 
Q^, the x^TO^/Q^-type TMC can be an important part 
of the measured cross sections, and should be identified 
and removed before we extract the leading power PDFs 
at large x. 
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Following the pioneering work by Georgi and Politzer 
(GP) in as early as 1976 p7|], many papers have been 
written on TMC, in particular, for lepton-hadron DIS. A 
recent review by Schienbein et al. provides a nice sum- 
mary of this effort [IS;]. Most existing calculations use 
the technique of operator product expansion (OPE) to 
resum mj^/Q^ corrections to the structure function mo- 
ments. A strong debate has been centered on the inver- 
sion of the moment formula [H, [H, [13, [HI ll^l . If we 
keep the target mass in the DIS kinematics, the Bjorken 
scaling variable xb for the DIS cross sections or struc- 
ture functions needs to be replaced by the Nachtmann 
variable [23,], ^ = 2x3/(1 + y/TTix^mjjQ^) ^ Xb sls 
"Iat/Q^ 0. If the target mass cannot be neglected at 
low Q^, the Nachtmann variable is less than 1 even 
at Xb = 1. Only if one ignores the xb = 1 kinematic 
threshold, and allows ^ to run up to 1, does the inverse 
Mellin transformation of the structure function moments 
give back the structure functions in a;^ space [20, 21]. 
As a consequence, the inverted structure functions are 
finite in the unphysical > 1 region. The unphysical 
region has been argued to disappear with the inclusion 
of power-suppressed higher-twist terms in the computa- 
tion [18]. Alternatively, many prescriptions have been 
suggested to fix the moments inversion problem, or to 
phenomenologically eliminate the unphysical region, see 
pol [2ll . [22] . None of these prescriptions is entirely satis- 
factory or unique. 

To completely avoid the ambiguities in connection with 
the structure functions moments and their inversion, it is 
natural to investigate the TMC in the momentum space 
without using the OPE and taking the moments. This 
is most easily done in the context of the field theoretic 
pQCD parton model, as pioneered by Ellis, Furmanski, 
and Petronzio in Ref. [l^. Recently, Kretzer and Reno 
applied and compared both approaches in the case of neu- 
trino initiated DIS experiments [13, [1^. In this paper, we 
revisit the TMC in DIS in terms of the perturbative QCD 
collinear factorization approach in momentum space and 
express the long distance physics of structure functions 
and the leading target mass correction in terms of PDFs 
that share the same partonic operators with the PDFs of 
zero hadron mass. In our approach, the momentum space 
structure functions have no unphysical region. Moreover, 
our approach can be generalized to semi-inclusive DIS 
and hadronic collisions, where the OPE is not applica- 
ble. 

In the collinear factorization approach at the leading 
power in the short-distance factors are perturba- 

tively calculated with massless final-state light partons. 
As recently pointed by Collins, Rogers and Stasto in 
Ref. 16], the outgoing parton lines should acquire jet 
subgraphs/functions to have correct kinematics. The in- 
variant mass in the jet subgraph leads to the before men- 
tioned TO^/Q^-type JMC, which are particularly sensitive 
to the large-x b kinematics and the extraction of large-x 
PDFs. In this paper, we discuss the role of the jet func- 
tions in modifying the DIS kinematics in the collinear 



factorization approach. We neglect the soft interactions 
between the beam jet and the final-state jet functions, 
and present a collinear factorization formalism for calcu- 
lating DIS structure functions with a non trivial jet func- 
tion. Based on a toy-model estimate, we argue that the 
JMC has a significant effect on the extraction of PDFs 
when X > 0.6. The connection of the jet function with 
lattice QCD computations of the non-perturbative quark 
propagator is also discussed. 

The rest of our paper is organized as follows. In Sec. [Ill 
we drive the TMC in terms of QCD collinear factoriza- 
tion in momentum space. We explicitly demonstrate that 
our result has no unphysical region for the DIS structure 
functions. We compare our result with TMC predicted 
by other approaches. In Sec. IIIIl we discuss the JMC. 
Finally, we present our summary and thoughts on fu- 
ture extensions in Sec. IIVI In the main text we limit 
the discussion to light partons and the transverse and 
longitudinal structure functions. In the appendices, we 
generalize our formulae. 

II. TARGET MASS CORRECTIONS 

The DIS cross section is determined by the hadronic 
tensor 

W''-{p,q) = dSe-*«-(plJtM(,-)j-(0)|p) , (1) 

where p is the nucleon 4-monientum, q is the virtual 
boson 4-momentum, J'^ is the electromagnetic or elec- 
troweak current, and \p) is the hadron wave function. In 
the impulse approximation the lepton-nucleon interac- 
tion proceeds through the scattering of the virtual boson 
with a parton (quark or gluon) belonging to the nucleon, 
and having 4-momentum k, see Fig. [T] With these 4- 
momenta we can build the following useful invariants: 

2 2 

q 2 2 2 q 

(2) 

The first 3 invariants, namely, the Bjorken variable xb, 
the rest mass oi the nucleon and the vector boson 
virtuality Q^, are experimentally measurable. We call 
them "external invariants". The fourth invariant, x/, 
is the Bjorken variable for a partonic target and is not 
experimentally measurable, so we call it "internal" . 

We work in a class of frames, called collinear frames, 
defined such that p and q do not have transverse momen- 
tum. Then we can decompose p, q and k as follows. 

p^" = p+n^" + 

q^^ = -^p+nf^ + (3) 
2a;p+ 
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FIG. 1: CoUinear factorization of the hadronic tensor in the 
impulse approximation. The top blob represents the interac- 
tion of a virtual boson with a parton computed in pQCD at 
any order in Os- 



The light-cone vectors and n'^ satisfy 

n -n = 1 , 



2 —2 

n = n 



(4) 

and define the Hght-cone plus and minus directions, re- 
spectively. The plus- and minus-components of a 4- vector 
a are defined by 

= a ■ n = a ■ n. (5) 

ff we choose n ~ (l/\/2, Oj^, l/\/2) and n = 
(1/V2,0j.,-1/V2), we obtain = (ao±a3)/V2. The 
transverse parton momentum kx satisfies kr-n = kx-n = 
0. The nuclcon plus- momentum, p"*", can be interpreted 
as a parameter for boosts along the z-axis, connecting 
the target rest frame to the hadron infinite-momentum 
frame. The parton fractional light-cone momentum with 
respect to the nucleon is defined as 

X = k+/p+ , (6) 

and is an internal variable. The virtual boson fractional 
momentum 

e = -^ = ^-^ (7) 

is an external variable, and coincides with the Nacht- 
mann variable [1^. Note that in the Bjorken limit 
{Q'^—^co at fixed xb) C^^b and we recover the standard 
kinematics in the massless target approximation, fn this 
paper, we will consider light quarks u, d, s only and set 
d s = 0. In Appendix IB] we will extend our results to 
heavy quarks. 

CoUinear factorization for the hadronic tensor can be 
obtained by expanding the parton momentum k in Fig. [T] 
around its positive light-cone component, 

fc^ = xp+n^" . (8) 
Correspondingly, we can define the coUinear invariant 



Xf 



2k- 



(9) 



According to the QCD factorization theorem [T] , the nu- 
cleon hadronic tensor can then be factorized as follows: 

W'^-ip^q) 



(10) 



o(aVq') 




> 



Py > m% 



net baryon number 



FIG. 2; DIS in the impulse approximation, for the special 
case of an internal on-shell light parton, fc'^ = 0, relevant to 
coUinear factorization. The current jet has momentum pj and 
the target jet has momentum py- The net baryon number is 
only shown to flow in the target jet (lower part of the graph). 



where Ti^" is the short-distance partonic tensor for scat- 
tering on a parton of flavor /, and ^//n is the leading 
twist parton distribution function for a parton of flavor 
/ inside a nuclcon N, see Fig. [T] For example, the quark 
distribution at leading order in as is defined as 



dz 
2^ 



(11) 



A proper gauge link between the two fermion field oper- 
ators is required to have a gauge-invariant parton distri- 
bution, but drops out if one chooses the light-cone gauge 
n ■ A = 0, where A is the gluon field. Higher orders in 
the Taylor expansion are suppressed by powers of A^/Q^, 
with A a hadronic scale, and contribute to restore gauge 
invariance in higher twist terms We will discuss 

in detail how to obtain such a factorized form in Sec- 
tion [Till In Eq- (|10P- the partonic tensor Ti'^" can be 
computed perturbatively to any order in as , and can de- 
pend on the nucleon mass only kinematically through the 
invariant x / . Dynamical target mass corrections can en- 
ter only through the proton wave function whence 
the exphcit dependence of (p on mj^ in Eqs. piI)) - PT|) . 
From now on we will suppress such dependence for ease 
of notation. For higher twist terms, the situation is more 
complicated, because the equations of motion may induce 
dynamical correlations between lower- and higher-twist 
terms [l3|, but we will not discuss this issue here. 

Structure functions are obtained by suitable projec- 
tions of the tensors in Eq. PU)) . see Appendix [Bl In this 
paper, we choose the helicity basis to perform the pro- 
jection of the W'^'^ and H'^'^ tensors. The transverse and 
longitudinal structure functions read 



FT,L{xB,Q^,m]^) 



E 



dx . 



•f\T,L{^f,Q'^)^f{x,Q'^) ■ 

(12) 



The advantage of the helicity basis is that in the right 
hand side there are no kinematic prefactors, which would 
appear when considering the Fi 2 structure functions, as 
discussed in Ref. [26| and reviewed in Appendix [Bl 
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Applying the factorized Eq. (jlOp without paying atten- 
tion to the kinematic limits on x, which have been un- 
derstood in Eq. ^TU\i . and using Eq. one would obtain 
what we call the "naive" TMC in coUincar factorization: 

F^lixB,Q^m%) = 4"l(e,Q') , (13) 

where F^^^j^ are the structure functions as they would be 
defined and computed in the massless nucleon limit by 
setting mf^ — from the beginning. Indeed, the partonic 
structure functions hf^^.L a-re independent of the hadron 
target, and are defined in the same way for the massive 
and massless nucleon cases. As a consequence of the fact 

that F^\{y,Q'^) has support over < y < 1, the target 
mass corrected can be different from zero in the 

kinematically forbidden region 1 < xb < 1/(1 — m^/Q^). 
The appearance of such an unphysical region is also a 
feature of the OPE approach [ij, [3 ' discussed in the 
introduction. Eq. p3)) has been introduced in Ref. (26|] 
and compared to the OPE approach in Refs. [l^, • 

In fact, a closer examination of the handbag diagram 
kinematics reveals that there is no unphysical region. Let 
us consider the handbag diagram in the right hand side 
of Fig. [l] and limit the discussion to on-shell light quarks 
or gluons, = 0, in both the initial and final states. 
The general case of off-shell partons, including heavy 
quark production is discussed in Appendix [XI Because 
of baryon number conservation, the net baryon number 
must flow either into the target jet or into the current 
jet. We shall separately examine these two cases. If the 
net baryon number flows into the target jet (bottom part 
of Fig. [5]), the jet invariant masses satisfy m| = > m'j 
and Py > 'm%- Let us consider the invariant momentum 



square of the process, s = {p+q)'^ = {pj+py)^- Since the 
2 jets are made of on-shell particles, pj • _py > 0. Hence, 
s > rrij + mj^. In summary, the current jet mass must 
satisfy 

< m| < s - TO^ . (14) 

Since s — = {1/xb — 1)Q^, Eq. p4)l guarantees that 
the handbag diagram is non-zero only when xb < 1, as 
it must be on general grounds because of baryon number 
conservation, irrespective of the model used to compute 
the process. On the other hand, if the net baryon num- 
ber flows into the current jet (top part of Fig. [J). The 
invariant jet masses satisfy m| > and py > 0, so 
that 

m\i < < s , (15) 

which again guarantees that the handbag diagram re- 
spects the < 1 limit. Within the coUinear factoriza- 
tion approach, the momentum of the active quark en- 
tering the short-distance hard part that generates the 
current jet is approximated to be on mass shell, fc^ = 
mj^. That is, the baryon number is very likely to 
flow into the target jet for the factorized contribution to 
the DIS cross section, and Eq. (fT4)) gives the relevant 
limits on m|. Using m'j = {k + q)"^ = {l/xf — l)Q^ and 
Xf ~ £^/x m Eq. (fT4| . we obtain 

XB<Xf<l, (16) 

which implies the following limits on the dx integration 
in Eq. (HID: 

C < x < — (17) 

XB 
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- R/R<o) 




FIG. 4: Comparison of prescriptions for NLO target mass 
sections, R — ctl/ctt ~ Fl/Fi. The ratio TJ/i?'"' is plotted 
computed using MRST2002 parton distributions 

Eqs. (fT6| - p7|) explicitly guarantee Ft,l = if a;^ > 
1, so that there is no unphysical region for target mass 
corrected structure functions: 

FT,LixB,Q'^,m%) 

= / hf\T.L(Xf,Q )'Pf(X,Q ) . 

J ^ X 

Eq. is our formula for calculating DIS structure func- 
tions with the TMC. As expected, it has the hadron mass 
dependence explicitly in the integration limits caused by 
the DIS kinematics and implicitly from the hadron states 
in the definition of the PDFs. The naive structure func- 
tions (|13|) are obtained when considering a; < 1 as upper 
integration limit in Eq. (|18|) . This limit is a general and 
process-independent consequence of the definition of a 
parton distribution in the field theoretic parton model 
[27| . but in DIS it is weaker than x < ^/xb, which is in- 
duced by 4-momentum and baryon number conservation. 
In the massless target limit, m^/Q^— >0, the constraint 
(fT7)) reduces to xb < x < 1, and we recover the massless 
structure functions as we should expect: 

FT,LixB,Q^m%) .4°|(xb,Q2) . (19) 

In Fig. [3l we plot the ratio of the TMC corrected F2 to 
the massless Fj"-* , with TMC computed using the analog 
of Eq. p^ . sec Appendix lB 3[ the naive prescription 
and the Georgi-Politzer prescription. The corrections are 
in general quite large at = 2, but still non negligible 
at the generally considered "safe" scale = 25 GeV^. 
From the right panel of the figure, one can estimate how 



-I I I I I I I I I I I I I I I I I I I I I I I I- 



R/R(o) - 

RGP/R(0) 




Q2 [GeV2] 

corrections to the ratio of the longitudinal and transverse cross 
as a function of xb and Q^. The structure functions have been 



large should be to safely neglect TMC. At xb < 0.5 
the TMC are smaller than 5% if > 10 GcV^. However, 
at larger xb, one may need to go to > 100 GeV^ for 
TMC to become small. Note also the difference between 
F2 and ^2"^, which is smaller than 30-40% at = 2: it 
gives the size of the contribution of the unphysical region 
^/xb < X < 1, which has to be subtracted from the naive 
structure function. 

The difference between TMC of F2 (and similarly of 
Fi_t) in coUinear factorization and in the Georgi-Politzer 
formalism is smaller than 15-20% at the lowest scale, and 
rapidly disappears at larger scales. So one is tempted to 
brush aside the question of what formalism is correct, if 
willing to accept this level of uncertainty. However, the 
situation completely changes when considering F^ , or the 
ratio R of the longitudinal to transverse cross section, 

R=--^^ ^^°^ = %)^ (20) 

whose TMC/massless ratios are plotted in Fig. 31 (Note 
a factor of 2xb with respect to other common conven- 
tions, see Appendix lB 21 ) The TMC of i? are much larger 
than for F2 . Most importantly, the difference between the 
coUinear factorization and Georgi-Politzer TMC is huge, 
up to a factor 10 (5) at Q^=2 (25) GeV^! Therefore, 
one has to decide which formalism to use. This is espe- 
cially important for a fit of the gluon PDF, to which Fl 
is sensitive. 

It is also important to note that our formula for TMC 
in Eq. (fT5)) explicitly eliminates the kinematically forbid- 
den region 1 < xb < 1/(1 — ™Ar/Q^) because of the in- 
tegration limits on the parton momentum fraction x. As 
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FIG. 5: DIS handbag diagram at leading order in as- 



— > 1, structure functions calculated by using Eq. 
approach to zero, the kinematic limit, smoothly, except 
for the lowest order contribution, whose partonic struc- 
ture functions are derived from the tree level handbag 
diagram in Fig. [5l Indeed, by explicit computation at 
tree level in the approximation of massless quarks, we 
obtain 

\^T{xf, Q") = \ ej 5{xf -l)^^e}xSix-0 , (21) 

where e/ is the electric charge of the parton /. Sub- 
stituting the tree-level partonic structure function into 
Eq. (dHl), the lowest order contribution to the transverse 
structure function, 

1^ xb > I , 

remains positively finite when — s- 1 and does not van- 
ish as xb^I, as shown by the dashed line in Fig. [SI 

This problem exists only at the lowest order and arises 
because of the 5-function behavior of the partonic struc- 
ture function (|2ip and the assumption that the final-state 
is made of a massless quark, m'j — 0, as shown in Fig. [S] 
The (5-function bypasses the kinematic constraint from 
the integration limits in Eq. ((18]) and forces x = ^(xb), 
which exhibits the mismatch between the phase space for 
x at the parton level and that for ^{xb) at the hadron 
level. Under the coUinear approximation the momentum 
fraction x for a massless parton can be as large as 1, 
while the plus momentum fraction of the virtual photon 
^(xb) smaller than 1 for a finite target mass mjy. As a 
result, the perturbatively calculated structure functions 
do not vanish at xb = 1 bec ause the PDF s are finite at 
X = £_{xB = 1) = 2/(1 + ^1 + 4TO^/g2) < 1. As we 
will discuss in the next section, this explicit phase space 
mismatch at the lowest order could be improved if the 
single massless quark final-state in Fig. [5l which is not 
physical, is replaced by a jet function as shown in Fig. [T] 

We conclude this section by stating that if one is per- 
forming global QCD fits of the PDFs in the context of 
pQCD coUinear factorization, our formalism in Eq. 
might be the most consistent way to treat TMC, because 
it expresses the long distance physics of structure func- 
tions and the leading target mass correction in terms 
of PDFs that share the same partonic operators with 
the PDFs of zero hadron mass. Moreover the structure 
functions calculated using our formulae do not have the 




FIG. 6: Transverse structure function plotted as a func- 
tion of xb, with and without target and jet mass corrections, 
computed with only light quarks at lowest order in a, using 
MRST2001LO parton distributions [sB]. The dotted line is 
the massless structure function. The dashed line includes only 
TMC, and corresponds to Z = 1 in Eq. (|65|l . The solid line 
corresponds to JMC coming only from the continuum part p 
of the jet spectral function, Z — in Eq. (|65p . JMC are com- 
puted using a log-normal spectral function with (m^) = 0.3 
GeV^ and standard deviation a^2 — {rrij). 



Xb > 1 unphysical region and vanish at the xb = 1 
kinematic limit except for the lowest order contribution 
that will be discussed further in next section. The same 
coUinear factorization formalism can be easily and con- 
sistently extended to semi-inclusive DIS measurements 
and hadronic collisions, for which the OPE formalism is 
not applicable, but which are included in global QCD 
fits of parton distributions. Careful analysis of kinemat- 
ics and conservation laws will guarantee that no unphys- 
ical region appears in these observables, as well. The 
obtained formulae will not merely be an approximation 
to the TMC for those processes, as argued in [25j . 
but will give the correct answer in the context of pQCD 
coUinear factorization. 



III. JET MASS CORRECTIONS 

In this section we discuss the possibility to include a jet 
function into the lowest order contribution to have a more 
realistic kinematic constraint on the "single quark" final- 
state [iBl . Hopefully, we can reduce the unphysical posi- 
tive value of structure functions at xb = 1. As discussed 
in the last section, this is caused by the 5-function behav- 
ior of the partonic structure functions, the assumption 
that the final-state is made of a massless quark, m? 0, 
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FIG. 7: DIS handbag diagram at leading order in Os gener- 
alized to include a jet function J{1) beside the target function 
f{k). 



and the mismatch between the phase space for ^{xb) and 

X. 

The assumption that the leading order final-state is 
made of a massless quark, m'j = 0, is clearly unphysical 
because the quark has to hadronize due to color con- 
finement, so that the current jet will have an invariant 
mass m|. Then, we may heuristically set mj = rrij 
for the cut quark line, and substitute 6{xf — 1) with 
6{xf ^ 1/(1 + m2/Q2)) in Eq. 1^: 

h,\Tixf,Q')^- e} X S{x - e(l + ■^)) . (23) 

Furthermore, we may assume that the current jet has an 
invariant mass probability distribution Jm(™j) normal- 
ized to 1, and accordingly smear the structure functions 
in miD: 



Fr 



dm'^Jmimj) / — hf\T{xf,Q'^)(pf{x,Q'^) 
X 







A. Collinear factorization with a jet function 

We aim at including in the DIS handbag at leading or- 
der in as a suitable jet function to take into account the 
invariant mass of the jet produced by the hadronization 
of the struck quark, see Fig. [T] Note that in computing 
the DIS cross section with the handbag diagram of Fig. [3 
we are making several assumptions. First, we are assum- 
ing that it makes sense to separate the final state into a 
current jet and a target jet, respectively the top and the 
bottom blob. Because of color confinement, this separa- 
tion can only make sense as an approximation, and is jus- 
tified for inclusive and semi-inclusive cross sections if the 
rapidity separation between the 2 jets is large enough. 
This is in general the case at asymptotically large Q^. 
However, at finite Q^, the rapidity difference between 
the 2 jets tends to as xs^l, and the struck quark may 
participate in the hadronization process together with 
the unstruck target partons. Thus, we need to take care 
in estimating the range in xb in which the handbag dia- 
gram is a meaningful approximation to the DIS process. 
The second assumption we make, intimately related with 
the first one, is that color neutralization of the current 
jet happens via the exchange of soft momenta, which we 
can neglect when discussing 4-momentum conservation. 

In order to obtain a collinear factorization formula, we 
will closely follow the procedure of Ellis, Furmanski and 
Petronzio UM- The hadronic tensor is 



(2^ 



Tr[f(fc)7''J(07l K(fc,p,g) . 

(27) 



(24) 



where we considered only 1 flavor for simplicity. The sum 
over quark flavors will be restored at the end of the com- 
putation. We use a hat to denote a matrix in Dirac space. 
The trace over color indexes can be easily factorized and 
included in the target function [l^ . The remaining trace 
is over Dirac indexes. The target function T is defined 
as 



If J™(m2) is a sufficiently smooth function of mf, we [Tik]] = ^ 5^'^^ (p " ^ - 51 P') I ^^'l^' ^) I 



obta 



I'rp (xb,Q ,m 







(25) 



= I d*ze"-''{p\i>,{z)MO)\p) , 



(28) 



The jet mass corrections (JMC) so introduced are of 
order Oirrij /Q'^). It is easy to see that in the limit 

(m|), the massless F^^ decouples from the in- 
tegration over the jet mass, and we recover the structure 
function with TMC: 



-^FT(xB,Q^m^) . (26) 



where {k,Y\ = {Y\ are all possible final states 

originating from the target fragmentation, and (fc| is a 
parton state of momentum k. Analogously, the jet func- 
tion J is the non-perturbative quark propagator: 



Y ieY 

f4 ^^iz-. 



(29) 



In the following, we will discuss how to put this Ansatz 
on a more firm theoretical basis. 



and {l\ is a quark state of momentum The jet momen- 
tum is constrained by momentum conservation to I = 
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k + q, but it is useful to keep it explicit in our formulae. 
The function K is included to impose the kinematic con- 
straints, the non-trivial one being a;™'" ^ Xf < x/min, 
see Appendix [XI 



K{k,p, q) = e{k+ + q+)e{k- + q-) 

X e(p+ - k+)e{p- - k-) 

X 9{xf — xb)0{1 ~ Xf) , 



(30) 



where, for light quarks. 



X 



xb 



1 



(31) 



i-fc2/g2 



To obtain the leading power contribution, we expand 
T(k) in terms of Dirac matrices and neglect terms that 
depend on the vector defining the direction of the gauge 
link in the PDFs, which are suppressed by powers of l/Q^ 

f(A:) -ri(fc)i + r2(fc)^ + T3(A:)75+T4(fc)^75, (32) 



and, analogously, we expand J as : 



(33) 



For massless quarks, Ti = 0. The terms T^^A^ which are 
proportional to 75 cancel when computing unpolarized 
cross sections. In pure QCD, jj,,/^ = because of par- 
ity invariance, and ji only enters in traces with an odd 
number of 7 matrices, hence does not contribute. We 
are left with the terms proportional to T2 and j2- The 
dominance of the k~^ and l~ components of k and I in 
the Breit frame suggests to define 



T2(k) = 



1 

1 
1 



Tr[^^f (fc)] 

d4ze*"-'=(p|^(z)7+V(0)b) 



(34) 



J2(0 = T7^Tr[^J(/)] 

(35) 

After these manipulations, the hadronic tensor reads 
dk+dk-(fkT 



W^^^ip.q) 



(36) 



X ^Tr[|^777''] j2(0T2(fc)K(fc,p,g) , 



where 



k^ = xp+n^' + ^ T' n^ -I- k^ 
2xp+ 



(37) 



^' = ^--^)P^^' + {^^ + ^>' + k>^- (38) 



For later use, let us also define 



-i7r(fc,0 = /Tr[^777^] 



(39) 



Our goal is to obtain a factorized expression for the 
hadronic tensor in terms of coUinear parton distribution 
functions, see for example Eq. (|lip . For this purpose we 
need to let J dk~d^kT act only on r2(fc), which defines 
the coUinear PDF modulo factors of 2. In doing this we 
will be forced to make approximations on the momenta 
entering and exiting the hard scattering vertex, viz., k 
and /. In principle, one would like to avoid it and allow 
approximations in the computation of the hard scattering 
tensor only [l^. In this way, one can ensure that the 
final state obeys 4-momentum conservation, and avoid 
potentially large errors in region of phase-space close to 
the kinematic boundaries. While in most cases this is not 
a problem for inclusive cross sections, it might become 
very important for exclusive observables. In our case, 
we want to compute the inclusive DIS cross section at 
large xb^I in coUinear factorization: in order to extend 
the validity of our computation as close as possible to 
this kinematic boundary, we need to pay attention to the 
approximations we will make, keep them at a minimum, 
and estimate the range of validity in xb and of the 
approximations we will have to make. 

The first step in the coUinear factorization of the 
hadronic tensor p6p . is to expand H^'^ around the mo- 
mentum of a coUinear and massless quark: 



HrikJ)^Hrik,l) + 



dm 

~dk° 



-(fc"-fc") + 



where 



k^ = xp'^n^ 



(40) 



(41) 



The higher order terms in the expansion are suppressed 
as powers of A'^/Q'^, where A^ is a hadronic scale, and 
contribute to restore gauge invariance in higher-twist di- 
agrams 14]. In this paper, we will retain only the lead- 
ing twist term of the expansion. Note that we did not 
yet make any kinematic approximation: in principle, one 
may sum over as many higher-twist terms as desired. 

The second step involves using the spectral represen- 
tation of J (28j to explicitly introduce the invariant jet 
mass in the formalism: 



J{1) = / dm^ [Ji(m^2)I+ J2(m^2)f]27r(5(/2 - m2)6'(/°) , 
Jo 

(42) 

where the spectral functions Ji{m'j) are positive definite 
and normalized to 1: 



dm'^ Ji[m'^) 



(43) 
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In particular, by substituting Eq. p5|l into (|35l) . we ob- 
tain 

/•OC 

Ml) = / dm^ J2(m?) 27r5(/2 - m^.) 9{l°) , (44) 



so that we can interpret rrij as the jet invariant mass, 
and J2{'m'j) as its probability distribution. 

In the light-cone gauge n- A = 0, the jet spectral func- 
tion is related to the non perturbative quark propagator: 



d'*ze*^-'Tr[7~(0|V'(^)V'(0)|0)] 



(45) 



Computations of the non-perturbative quark propaga- 
tor have been performed in lattice QCD [2^ and using 
Schwinger-Dyson equations, see [30| for a review. How- 
ever, there are several difficulties in extracting informa- 
tion relevant to the jet spectral function from these com- 
putations: (i) the quark-antiquark correlator appearing 
in (I45p is typically computed in the Landau gauge in- 
stead of the light-cone gauge, (ii) computations are per- 
formed in Euclidean space instead of Minkowski space, 
(iii) one needs to extract the spectral representation from 
the computed correlator. The biggest problem is that the 
analytic structure of the quark propagator is not suffi- 
ciently well known to either perform the analytic contin- 
uation back to Minkowski space or to extract its spectral 
representation [sol. [sH. As a way to avoid this problem, 
it would be interesting to see if it is possible to rotate the 
whole handbag diagram, including its external momenta, 
to Euclidean space as done in [s^ for the computation of 
the hadronic contribution to the muon anomalous mag- 
netic moment. In this way one would be able to directly 
use the lattice propagator in the computation of the for- 
ward Compton amplitude. Alternatively, one may try to 
use the light-cone QCD formulation on the lattice dis- 
cussed in |3»^. which exploits the Hamiltonian formula- 
tion of QCD in order to remain in Minkowski space. A 
more phenomenological approach to the spectral function 
will be discussed in the next subsection. 

The third step involves our first kinematic approxima- 
tion. In order to factorize j2 from the dk~d^kT integra- 
tions, we need to approximate l^l, so that 



(46) 



Then, the hadronic tensor reads 



Wf"'{p,q)= I dm]j2(m]) I dk+ H^" {k, I) S{1^ - m^,) 



dk d'^kr 
(2^)^ 



2r2(fc) K{k,p,q) 



(47) 



where 0{l^) = 9{k^ + q^) is already included in the kine- 
matic constraint function K(fc,p, g). We can expect the 



approximation (|46p to be reasonable in a region where 
j2{l) does not vary strongly with In terms of the spec- 
tral representation, this requirement is satisfied if the in- 
tegral in Eq. (|47p is dominated by values of m| close to 
where the jet spectral function has a maximum. We will 
discuss below the conditions on x_b and for which this 
condition is satisfied. Note that this kinematic approx- 
imation only acts on the (5-function in Eq. (|47p so that 
>^2(w|) has been left unapproximated: in this sense the 
approximation is the mildest possible compatible with 
coUinear factorization. 

The fourth step involves decoupling K and T2 in 
Eq. ((47|) . It can be achieved by replacing 



K{k,q,p)^K.{k,q,p) = e{xf~XB)e{l~Xf) . (48) 



Note that 6'(fc" + q^) = 0{p+ - k+) = 6{p- - k' 
because of the constraints on Xf. In terms of x, 

i < X < S,/xB ■ 



(49) 



This is a delicate step because it involves approximating 
the kinematic constraints, such that the integration over 
kx and k^ are unbounded. This clearly is not a good ap- 
proximation as xs— +1 Tg*!, in which case the struck par- 
ton carries most of the nucleon plus-momentum, so that 
the minus and transverse components cannot be large. 
To appreciate this, consider 



(50) 



where py is the total four momentum of the target jet. 



and we define m 
kinematics of Eq. ([3]) , we obtain 

1-e 



> 0, see Fig. [2 Using the full 



(51) 



On the other hand, in the center-of-mass frame, py — —I 
and py.T ^It = ^kx, so that 



0\2 



s = (p^ + n 



(52) 



ip'yr 



kl 



(P)2)' 



> 4A:. 



Combining these 2 results, we obtain 



krp <^ 



4C 



(53) 



As xb^I, ^~^£,th ^ 1 so that the (1 — £,) factor tends to 
close the available fcr phase space. In Section IIII C[ we 
will discuss in which region of xb and we may in fact 
neglect this bound. Using the definition of T2, the 
hadronic tensor reads 



W>"'{p,q) - / dmU2{m^,) 



(54) 



dx 



Hr{k,l)S(P-m'Mx,Q-') , 
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where the quark PDF Lpq is defined as in Eq. ((TT|) . 

As a last step, we define an on-shell and massless jet 
momentum for the partonic tensor. 



= i- 



and replace 



-Tr[^777^] 



(55) 



(56) 



This is needed: (i) to ensure that q^H^^ = 0, hence the 
gauge invariance of the hadronic tensor, and (ii) to al- 
low use of the Ward identities in proofs of factorization 
[l6| . This approximation, made on the hard scattering 
coefficient, is less critical then the kinematic approxima- 
tions previously discussed because it does not change in 
itself the kinematics of the process. It is analogous to 
the approximation taken in considering the usual hand- 
bag diagram of Fig. [5] with a massless quark line joining 
the 2 virtual photon, except that it approximates only 
the computation of the Dirac traces. 

Finally, we define the LO hard scattering tensor 

(fc, g, m2) = Hr(k,T)S{? - m^) (57) 

= Tr[]^777l'5(P-m2) , (58) 

which for mj = coincides with the LO hard scattering 
tensor computed for a diagram without jet function, as 
in Eq. (|10p . The hadronic tensor can then be written in 
factorized form as 



(59) 



X 1^ (fc,g,m2)^,(x,g2) , 



which is the central result of this section. The transverse 
structure function reads 



/•oo 

Jo 



X 2_. / — hf\Tixf,Q'^,m'^)ipq{x,Q'^) , 

(60) 

with ipq defined in Eq. (jlip . The longitudinal structure 
function Fl = because = 0. An explicit compu- 
tation gives hxixfjQ'^) = ^e^jxS(^x — ^(1 -I- m^/Q^)) so 
that at LO 



FrixB.Q^.mjf) 



(61) 



= ^ ^^'V|j2(m,^)4")(^e(l + 9^),Q^) , 

Note that when where (m|) = 

J dm'j rrij J2{rrrj) , the massless i^^"' decouples from the 
integration over the jet mass, and we recover the TMC 
to the LO structure functions. 



B. The jet spectral function 

Let us discuss more in detail the properties of the 
nonperturbative quark spectral function J2, defined in 
Eq. Let us start from the definition of the j2 com- 

ponent of the jet function, 

j2(0 = 4j^Tr[7-J(0] 



Y,s^'\i-Y,p,){o\^fmY)j-{Y\i:fmo) 



(62) 



with / the quark flavor. For simplicity, we consider only 
light quark flavors with m/ <C TOtt. The color c of the 
quark operator ip is not neutralized, so that it must ap- 
pear in the final state \Y). In the physical process, we 
are assuming that the struck quark's color is neutralized 
by a soft gluon exchange with the target's remnant. We 
also assume that we can neglect the soft exchange for the 
purpose of evaluating the change of kinematics induced 
by the inclusion of a quark jet function on the lowest or- 
der contribution to the inclusive DIS. This assumption is 
likely valid if the jet and target rapidities are sufficiently 
separated. However, this might not be the case close to 
the kinematic limit = 1, and the approximation will 
break down as we shall soon see. Because of color con- 
finement, we may assume that no more than 1 particle in 
the final state is colored, all the other ones binding into 
colorless hadrons. The colored particle must be a quark, 
to match the quark operator's color, and we denote it by 
\q'fi)- Hence, the final state is made of 1 quark plus an 
arbitrary number of hadrons, the lightest of which is a 
pion: 



|y) = |gj.,)IM...|M 



(63) 



with N > and /' = / when = 0. The spectral 
function J2, defined in Eq. ([331), can be written as 



J2(m2) = Z5(m2 - ml) + (1 - Z)p{m^^) , (64) 

where the (5-function is due to the contribution of the 
single particle \qfi), and < ^ < 1 28]. The continu- 
ous and positive definite function p is the contribution 
of multiparticle states with A^ > in Eq. ([63)1 and is 
normalized to 1 because of Eq. Due to the as- 

sumption p has a bell shape: it is equal to up 

to = (m^ -|- niq)^ « m^, increases up to a maximum 
and then tends to as m|— s-oo to satisfy the normaliza- 
tion condition. Using Eq. in (pT|) . we obtain 

FT{xB,Q^m%) ^ Z fP {^,Q^) 
Z) 



(1 



dTO^/9(m^)F^"Me 1 



(65) 



Setting Z = 1 is equivalent to calculating the standard 
handbag diagram without the jet function, and one re- 
covers the TMC formula. 
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FIG. 8: Effect of jet mass corrections on Ft, computed with a toy jet spectral function as described in the text. Plotted is 
the ratio of Ft with both TMC and JMC to Ft with only TMC included, as a function of xb for = 2 and 25 GeV^ The 
shaded band corresponds to a log-normal jet mass distribution with {m|) = 0.2 — 0.4 GeV^ and a^2 = {m|) — 2{m|). The 

dashed and dot-dashed lines corresponds to delta functions at = and rUj — m%, respectively. 



The first term in Eq. ([55)1 shows that the introduction 
of the jet function in the handbag diagram goes some 
way toward softening the problem with the unphysically 
positive Ft at XB = ^, but does not solve it. The rea- 
son is that we cannot kinematically neglect the effect 
of the color neutralizing soft interactions when we com- 
pute the hadbag diagram close to xb = 1, where the 
rapidity difference between the current and target jets 
is becoming smaller and smaller. A full solution to this 
problem is the inclusion of a "soft function" , in addition 
to the target and jet functions, which describes the soft 
exchanges in the context of fully unintegrated correlation 
functions ^Td\ . The soft function has essentially the effect 
of smearing the jet function, avoiding the singular behav- 
ior displayed by the (5-function. For a phenomenological 
inclusion of the soft function in coUinear factorization, 
we can substitute J2 with a continuous function Jm such 
that 

J™(m2) > , (66) 

J 

because of phase space, and 

Jrnimj) ~ J2{m]) . (67) 

It can be physically interpreted as the (smeared) jet 
mass distribution, analogously to J2, and we will call 
it smeared jet spectral function. The structure function 
is then computed as in the Ansatz discussed at the be- 



ginning of the Section: 

FT{xB,Q'',ml) (68) 

We note that the jet spectral function J2 is defined as 
a quark correlation function in vacuum, therefore it is 
process-independent. On the other hand, the smeared 
jet function J,„ is process-dependent because it effec- 
tively includes the soft momentum exchange with the 
target. As a result, it's shape at m| < might de- 
pend on Xb and Q^. However, the average jet mass 
squared, {mj)m ~ drrij nij Jm{rnj) should exhibit a 
small sensitivity on xb and because we may expect 
(m|) » m^, see Section UlI CI Eq. (155)) is a reasonable 
approximation to the full handbag diagram computation 
in the region of {xb,Q^) phase space where the integra- 
tion over dm'j in Eq. ([5T)l extends well beyond the peak 
of the continuum p(m|), namely if 

^-^Q' > (m^), , (69) 
Xb 

where 

/"OO 

{m))p= dm]mp2{m]) . (70) 

In these conditions, the structure function ()68p is not 
much sensitive to the behavior of the jet function at small 
m^, where Jm may substantially differ from J2. 
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For practical applications of JMC to global QCD fits 
of the PDFs, it is necessary to develop a flexible enough 
and realistic parametrization of the smeared jet spectral 
function J^- For this purpose, one may try to use a 
Monte Carlo simulation of DIS events in order to gen- 
erate enough data and test possible parametrizations. 
One may also study the invariant jet mass distribution 
in e+ + e~— >jets events, where the same jet function J 
discussed in this Section appears in the LO cross-section. 
However, these studies lie outside the scope of this paper, 
and we leave them for the future. 



C. Numerical estimates 

In order to obtain an estimate for the magnitude of 
JMC and of the present theoretical uncertainty, we em- 
ploy a toy model for the jet spectral function. Let's 
consider a bell-shaped smooth function such as the log- 
normal distribution 



f{x]fi,a) = 



1 



x(t\/2tt 



exp 



(logx - 



2(t2 



(71) 



where 



log hf + 1 



(72) 



and X and cTx are the average value of x and its standard 
deviation. Then, we can parametrize the continuum part 
p of the toy jet mass distribution in Eq. (|6ip in terms of 
the average jet mass {m3)p and its standard deviation 



with 



(T = (J™ 2 



(73) 



(74) 



in units of GeV^ . From the typical particle multiplicity of 
the current jet at the JLab energy, we estimate (m^)p — 
0.2 — 0.4 GeV^, and assume cr,„2 = C{'m'^)p with C = 
1 - 2. 

In Fig. [HI we plot the JMC to the transverse struc- 
ture function as obtained in Eq. ([6T|) by neglecting soft 
momentum exchanges. The dashed line corresponds to 
Z = 1, and is equivalent to computing only TMC. The 
solid line corresponds to JMC coming only from the con- 
tinuum part p of the jet spectral function, i.e., Z = 0. For 
comparison, the massless structure function is plotted as 
a dotted line. The true jet mass corrected Ft should lie 
somewhere in between because < Z < 1, in general. 
With the smearing due to soft interactions, see Eq. 
the true Ft will tend to as xb^I. 



1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j 
g |_ {mf)=0.3 GeV2 | 
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FIG. 9: Range of validity of kinematic approximations used 
in deriving TMC and JMC. The solid line corresponds to 
Eq. (|75|l with (mj) — 0.3 GeV^; the dashed line corresponds 
to Eq. JTB)) and dZZI, with kl^^^ = 0.04 GeV^ 



mj)p value can be 



The sensitivity of JMC to the 
gauged from Fig. [H where we plotted the ratio of the 
Z ~ 1 TMC-only structure function to the Z = jet 
mass corrected structure function. For comparison, we 
also use p{m'^) = 5{m?^ — m^) and p{m'^) — (5(m^ — m^), 
considered as extreme cases of JMC. In the absence of a 



better knowledge of the value of Z and 



the overall 



theoretical uncertainty on JMC can be quite large, es- 
pecially at low — 2 GeV^, and is still non- negligible 
at ^ 25 GeV^. At moderate xb < 0.6 it is of the 
same order of magnitude of the TMC corrections to the 
massless Ft- 

Finally, we want to estimate in which region of xb and 
the kinematic approximations involved in step 3 and 
4 of the factorization procedure are expected to be valid. 
As we discussed after Eq. (|T7)) . replacing j2{l) with j2{l) 
makes sense only if the integral over drri^ is dominated 
by « "^j|max' "^hsre the spectral function J2 has a 
maximum, hence minimal slope. Looking at the integra- 
tion limits in Eq. (|68p , and noticing that for a probability 
distribution with the properties of the jet spectral func- 
tion it is typically true that ni^^^^^ ^ (m'^) 5, (m-^^ 



where (m 
ing condition: 



roc 
JQ 



dmj mj J„ 



(m 



we obtain the foUow- 



l- Xb ^ 
Xb 



> 



(75) 



Note that it coincides with the condition (|69p that insures 
we can indeed approximate J2 ~ Jm for the computation 
of inclusive DIS cross section. It also guarantees some 
rapidity separation between the current and target jets, 
which is needed to justify the handbag diagram in the 
first place. The approximation of step 4 consisted in ne- 
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glecting the integration limits on dk~ and (Phx- For the 
transverse momentum, we need at least to make sure that 
the average (k^) is well below the upper limit derived in 
Eq. (ESI): 




The difficulty is that we cannot estimate (fc|,) within 
coUinear factorization. To do this, we would need to 
resort to unintegrated PDF [s^, [3^ , which are still inte- 
grated over dk~ , or to the more recently proposed fully 
unintegrated PDF p^ . For a rough estimate, we may 
use the uncertainty principle and set the minimum trans- 
verse momentum k^^^^ — l/r^ « 0.04 GeV^, where 
is the nuclcon radius. pQCD evolution will then broaden 
it roughly according to 

(4) = fcT.„i„[l + Cl0g(QVfcTmi„)] , (77) 

with C a constant of order 1. The borders of the confi- 
dence region for the discussed TMC and JMC are plot- 
ted in Fig. [9] using the above estimate for {k'^), and 
(m2) =0.3 GeV2. 

IV. SUMMARY AND CONCLUSIONS 

In the first part of this paper, we computed the tar- 
get mass corrections to unpolarized DIS structure func- 
tions in the context of coUinear factorization. Because of 
the non-perturbative nature of target mass, we empha- 
sized that for any factorizable hadronic observable, the 
TMC can only appear explicitly in kinematic variables 
and implicitly in definitions of non-perturbative hadron 
matrix elements. The momentum space approach al- 
lowed us to avoid the ambiguities related to the moments 
inversion which affect the OPE treatment of Georgi and 
Politzer. In particular, we could respect 4-momentum 
and baryon number conservation, and obtain TMC cor- 
rected structure functions without unphysical contribu- 
tions at xb > 1- When performing global QCD fits of the 
PDFs in the context of pQCD coUinear factorization, the 
procedure presented in this paper might be the most con- 
sistent way to treat TMC, because it expresses the long 
distance physics of structure functions, and the leading 
target mass correction, in terms of PDFs that share the 
same partonic operators with the PDFs of zero hadron 
mass. Hence it allows to unambiguously separate the 
kinematic effects of the target's mass from its dynamical 
contribution to parton matrix elements and the PDFs. 

Our formalism for TMC in Eq. (fT8|) is valid at lead- 
ing twist and any order in as. Calculating TMC for the 
power-suppressed higher-twist contributions to the struc- 
ture functions is a non-trivial [l^ but important issue for 
measuring the size of parton correlations in the nucleon 
wave- function, which we leave to a future effort. The 
leading-twist formalism can be easily extended to polar- 
ized DIS structure functions [37] , for which a correct eval- 



uation of TMC is even more important than in the unpo- 
larized case because the bulk of available data is in fact 
in the large-xs domain. The extension to semi-inclusive 
DIS and to hadronic collisions is also very important, in 
order to fully include TMC in global PDF fits. An ex- 
ample is the Drell-Yan cross-section at large Feynman 
xf, which has the potential to further constrain large-x 
PDFs [38]. It is also straightforward to extend the TMC 
analysis to DIS on nuclear targets, in order to include the 
effects of nucleon binding and Fermi motion [39| . This 
is especially important for studying the large- a; neutron 
PDFs and the d/u ratio, which are extracted from data 
taken with a Deuterium target. 

In the second part of the paper, we examined the im- 
pact of a final-state jet function on the extraction of 
PDFs at large xb- We proposed to write the leading or- 
der hadronic tensor, hence the lowest order contribution 
to DIS cross section, in terms of the spectral representa- 
tion J2 of the jet function, which has the physical mean- 
ing of invariant jet mass distribution. We evaluated the 
impact of JMC on the leading order DIS structure func- 
tions, and found it to be potentially large even at not so 
small values of photon virtuality such as = 25 GeV^. 
In the NLO cross-section, the impact of JMC is likely to 
be reduced, because a non-zero jet invariant mass can be 
produced in the hard scattering beyond tree level, but 
is still potentially large. We also evaluated the range of 
validity in xb and of the approximations we made. 

For practical applications to global fits of PDFs, it is 
important to investigate the shape and properties of the 
smeared jet spectral function J^, which effectively in- 
cludes the neglected soft momentum exchanges in the 
final state. This can be phenomenologically done us- 
ing a Monte-Carlo simulation and then trying several 
parametrizations of Jm- In a more fundamental ap- 
proach, we noticed that the jet spectral function J2 is 
related to the non-perturbative quark propagator, which 
can be computed in lattice QCD or using Schwinger- 
Dyson equations. To avoid the difficulties connected to 
the analytic continuation to Minkowski space, one may 
try and rotate the whole handbag diagram to Euclidean 
space, or use a Hamiltonian-bascd formulation of lattice 
QCD. 

In conclusion, the obtained results on TMC and JMC 
will be very important when using large-xs and low-Q^ 
data on DIS structure function (like those obtained at 
Jefferson Lab) to extract reliable PDFs at large-x, and 
to disentangle kinematic effects from the dynamically in- 
teresting higher-twist parton correlations. The discussed 
extensions of our formalism to other processes will allow 
a fuU inclusion of TMC and JMC in global QCD fits of 
parton distribution functions. 
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APPENDIX A: KINEMATIC CONSTRAINTS AT 
FINITE 




FIG. 10: Lowest order couplings of a generic vector boson 
(7, W^, Z) to a parton of flavor / and mass m/. The masses 
of the produced quarks are mi and 7712. Left: boson-quark 
scattering (m2 = 0). Right: boson-gluon fusion. 



Let us consider the handbag diagram for a DIS process 
on a nucleon target, as depicted in the right hand side of 
Fig. [TJ We repeat the kinematic analysis of the handbag 
diagram performed in Section |TT1 but for the general case 
of an off-shell bound parton of momentum k, and fc^ < 
f . The limit of on-shell quarks of mass m'i, relevant to 



/• 

collinear factorization, can be obtained setting k'^ — mj 
and Xf —Xf in the formulae below. 

We consider the scattering of a generic vector boson 
(7, W^, Z) on a parton of flavor / of mass m/. The low- 
est order couplings are displayed in Fig. [TOl The masses 
of the quarks (other than /) coupled to the vector boson 
are mi and TO2- The current jet mass must satisfy 

(Al) 



nij > sth 



where 



Sth = (toi + . (A2) 

As discussed in Sec.|TTl the net baryon number is likely to 
flow through the bottom of the handbag diagram for the 
leading DIS contribution that is given by the collinear 
factorization formalism. Therefore, 



(A3) 



Using m'j = {q + fc)^ = fc^ + (l/a:/ — l)Q^ we obtain 



xb 



<Xf< 



(A4) 



1 - XBkyQ^ - ' - 1 + (sth - A:2)/Q2 ■ 

Using = [Q^ + lp){^~l)^ Eq. (|^ can alternatively 
be expressed as limits over the fractional momentum x — 
k+/p+: 

where 

.g'+Sth-fc2+A[fc2,-02,Sth] 



A[a, b, c] = y/a'^ + b'^ +6^ - 2{ab + bc + ca) 
We finally note that 

£ 1 



x^ 



(A6) 



(A7) 



APPENDIX B: INVARIANT AND HELICITY 
STRUCTURE FUNCTIONS 

1. Helicity structure functions 

We work in a collinear frame and for generality we keep 
the quark mass different from zero. For collinear on-shell 
partons we have from Eq. ^ 



2pi 



9" = -^V^n^ 
k^ — x'p^n 



_Q1 

2 



(Bl) 



■M I L 



2xp+ 



where to/ is the mass of the parton of flavor /. For later 
use, we define the shorthands 



2 "tjv 



PS -1 + 4x^-^2 
where, as in Eq. 



1 + 4a; 



2'^ 



Xb 



-q 



2p-q 



2k ■ q 



(B2) 



(B3) 



Following [26[, we define the longitudinal, transverse 
and scalar polarization vectors with respect to the virtual 
photon momentum q and a reference vector p, 



-q^ip ■ q)p^{p, q) 



4(p,g) = -^(0,±l,-z,0) 



where 



(B4) 



p\p,q)^l-p\^/{p-qf 



(B5) 
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It is immediate to verify that q) = p% and /9^(fc, q) = 
pj:. The polarization vectors satisfy the following condi- 
tions 



ex • £a' = for A 7^ A' 
ex ■ e\ = 1 for A = 0, - 

' — 1 



(B6) 



and, in particular, q ■ eo = q ■ e± = 0. The helicity 
structme functions -Fa arc defined as projections of the 
hadronic tensor: 

FxixB,Q^) - PriP, q)W^.ip, q) (B7) 



with A = L,T,A,S.,{Qq},[Qq]. The longitudinal, trans- 
verse, axial, scalar, and mixed projectors P^'^ are 

Pf{p, q) = q)e'';{v. q) + q)e^*{v. q) 

PTip^ 9) = qV+ip^ 1) - ^-ip^ iV-~*ip^ 1) 

Pi;-{p,q)^e>i{p,q)e-;*{p,q) (^8) 



Using 



£+{p,q)£+*ip,q) - £-ip,q)£-*{p,q) , . 
4(p.'?K*(p>9) + ^-(p>'7)£-*(p.'?) 

= -5'^"^ + (P, (P, 9) - £^ (P, (P, 9) , 

(B9) 

one easily sees that 
Ft{xbM'') = -W[^{p,q) + FLixB,Q^) - F,{XB,Q^) 
Fa{xb,Q') = 7 ^ ^M^(P^g) • (BIO) 

(P • gjPB 

Even if not apparent from Eq. (jB4[) . a consequence of the 
normalization conditions is that the reference vector has 
the only function to define the t — z and transverse planes 
in conjunction with q^^: as long as it lays in the t—z plane, 
a different reference vector defines the same polarization 
vectors [2^. For example, e'^{p,q) = e'^{k,q). As we will 
see, choosing k instead of p is convenient when defining 
the parton level helicity structure functions, which read 

hxixf, Q2) ^ p^^-(k^ q)n^,,{k, q) (Bll) 
and satisfy identities analogous to Eq. (|B10p . with p— >fc. 

2. Invariant structure functions 



with i = 1, . . . , 6 by the following tensor decomposition 
of the hadronic tensor: 



W^^'^{p,q) ^ g^'' + 'L^)f,{xbM^) 
+ (p' 



,M _ ^pPli) (p- _ q-Pll) F2{XB,Q 
q^ / \ (p- ) p ■ q 



le'^^'^'^Paqp 7^Fi{xB,Q ) 

p-q q 



Fz{xB-,Q )H Fe[XB,Q ) 



2p-q 
and 

n^^k,q)^{ 



2p-q 



(B12) 



9'"' + ^)hiixf,Q^) 



k-q q" 



k^^q'^+q^k- fcMg^-gMfc^ 

-h^(xf,Q )H ^ he[Xf,Q ) . 



2k ■ q 



2k -q 



(B13) 



These 2 definitions differ from the notation of Ref. (26[ 
in the chosen denominators. Our definitions have the 
advantage of displaying a duality between the hadron and 
parton level, which can be obtained from each other by 
exchanging p ^ fc, and lead to a lesser degree of mixing 
between the hadron and parton structure functions under 
collinear factorization, see Eq. (|B23p . By applying the 
projectors (|B8l to Eqs. (|B12p - (|B13p . it is straightforward 
to show that 



Fl = 


Fl + ^ F2 

2XB 


hL - 




Fx — 


2Fi 


Ht = 


2hi 


Fa^ 


PbFs 


hA = 


Pfhs 


Fs = 


Fi-F^ 


hs = 


hi — 


F{Oq} 


= -rbF^ 


h{Oq} 




F[Oq] ^ 


= -PbFq 


hoq] 


= -psK , 



(B14) 



where we understood the dependence of Fi\ on {xb,Q^) 
and of hix on {xf, Q^) for ease of notation. Note that F^ 
differs by a factor of 2xb with respect to other common 
conventions. In our notation, the ratio R of transverse 
and longitudinal electron-nucleon cross sections reads 



For a generic lepton-hadron scattering, we define the 
hadronic Fi and partonic hi invariant structure functions 



„ TT Fl 



(B15) 



16 



3. Collinear factorization for structure functions 

As discussed in Section and Appendix \^ the 
collinear factorization theorem states that 



where 



X 1 _ sun. 



xb 



1 ~ XBmyQ"^ 
1 

1 + (sth - m2)/Q2 



(B16) 

(B17) 

(B18) 
(B19) 



The corresponding limits of integration on dx, namely 



can be read off Eq. 



6]) setting — m^. 



As discussed in Appendix IB H Pi^''{p,q) = P^''{k,q), 
hence the factorization theorem for helicity structure 
functions reads 



Fx{xB,Q^,m%) 

= E 



^h{{xf, 



E 



^hi{xf,Q')^f/j,[j-,Q^^ 



(B20) 



where 



2xf 



+ ^l + Axjmj/Q^ 



(B21) 



The last line of Eq. (|B20p is particularly interesting, be- 
cause the Nachtmann variable ^ only appears in the ar- 
gument of if, without touching the integration limits. In 
shorthand notation, where we highlight the dependence 
on and ^ and suppress that on and Q^, and un- 
derstand the sum over /, the helicity structure functions 
read 



Fx{xb) = h{(g,ipf/NiO 



(B22) 



For the invariant structure functions, kinematic prefac- 
tors often appear: 



FiixB) 
F2{xb) 

F3,5,6{^b) 
FiixB) 



Pb 

Pb 

Pf 



PB 



(B23) 



The "massless structure functions" can be obtained by 
setting m% = 0, hence, £_ = xb in Eqs. (|B22ll - (|B23p : 



F\!h^B) = Fx4xb)L2^-o 



(B24) 



In this definition we left the quark mass m / arbitrary. 

The "naive" target mass corrected structure functions 
F"^ are obtained by using a; < 1 as an upper limit of 
integration over dx in Eq. (|B20|) . This limit is a general 
and process-independent consequence of the definition of 
a parton distribution in the field theoretic parton model 
[27| . but in DIS it is weaker than x < x™'^^, which is in- 
duced by 4-momentum and baryon number conservation 
as discussed in Section HIl In detail, the naive helicity 
structure functions read 

FrixB) = (B25) 
—hx{xf,Q^)^f/N{x,Q') (B26) 

Jj-min X 



Using the definition of massless structure functions, one 
finds 



F^'lixB) = F^^liO 



jTinv 
-^3,4,5 



(xb) 
(xb) 



1 Xb „(o) 



pI e 



-F. 



(0) 



3,4,5 



PB 

Fi'Ho 



(B27) 



^F^'\0 



PB 



These formulae have already appeared in [2J, |25|, 
[26j . modulo the change of notation discussed in Ap- 
pendix IB 21 As already noted in the main text, they 
are non-zero in the unphysical region xb > 1- 



4. Structure functions with Jet Mass Corrections 



At LO, the helicity structure functions with jet mass 
corrections read 



FxixB,Q^,rn%) 



" dmp,{m^)Fi'>\^{l+myQ%Q^) . 

(B28) 

The JMC to invariant structure functions can be ob- 
tained from Eqs. (|B28p and (|B14p . Suppressing the 
and dependence of the structure functions for ease 
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of notation, we obtain: 



^1 \xb) 



^2 \^B) 



1 Xb 



dm,J2{mA 



p|^(l + m2/g2) 



xF(°)(e(l + m?/Q^)) 



1 



■^3,5,6 (a;^) - — 



l-Xg „2 



d™^ J2(rn^) 

Pb Jo 

dm^J2 



(e(i + -,VQ^))}. 



(B29) 



APPENDIX C: TARGET MASS CORRECTIONS 
IN THE OPE FORMALISM 

We collect here for completeness the target mass cor- 
rections to the electromagnetic structure functions ob- 
tained in the operator product expansion formalism of 
De Rujula, Georgi and Politzer [l3[il|, see also [ll| for 
a thorough review and discussion: 



Fr{^B,Q') 



Xb 

PB 

„2 
-^B 

Pi 

Xb 

PB 



Ff\(..Q^) 



e 



rrij^XB 

Wp^ 

m%XB 

Q^PB 

^ rnjfXB 
Q^PB 



A2{XB,Q^ 



A2ixB,Q^) 



A2(xs,Q") 
(CI) 



where 



= / dv 



Q'^PB 



F^iv^Q') 



(C2) 



and i^f ^ are the perturbative structure functions com- 
puted in the massless target approximation mj^ /Q^^O. 
Formulae for the F^J^ structure functions can be found 
in Ref. [40^. 

Note that in the notation of Appendix [Bl differently 
from Ref. (isj . the longitudinal structure function is de- 
fined such that 



Fl{xb) - t^F2{xb) - Fi{xb) , 
2xb 



and 



R{xb) 



_ (tl{,xb) _ Fl{xb) 



(C3) 



(C4) 



arixB) Fi{xb)' 
This notation is explained in detail in the Appendices of 
Ref. [Hi. Combining Eqs. jCH and (IC4| we obtain 



FiixB) 



P% F2{xb) 
2xB 1 + R{xb) 



(C5) 



in agreement with Ref. [l^| . 

Equations (jCip have been used to compute the OPE 
target mass corrections in Figs. [3] and ID Note that both 
Fj^^ and F^^^ receive a correction from an integral of 



Fj*'"' ^ F^"l. This explains the large size of the target 
mass corrections for the OPE curves of Fig. 21 
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